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ABSTRACT 

TJiis report is a reference manual for a programming language 
developed at Stanford Research Institute for the Scientific Data Systems 
940 computer. The compiler is now fully operational; it is written in 
its own language, compiles itself, and is in daily use for development 
of our CRT- display service system. 

Tlie name MOL940 (or simply MOL) , is an acronym for 
"Machine-Oriented Language." MOL is an ALGOL- like language with natural 
extensions for bit manipulation. The added syntax strongly reflects the 
internal design of the SDS 940, in accordance with the name MOL, 

Tlie introduction to this report includes a brief summary of other 
projects of the same nature which were known to the authors. There is 
also a discussion of the design criteria that shaped the MOL, The major 
topics are the comprehensibility of programs written in the language, 
the needs of system programmers working within a time-sharing system, 
and the effects on coding that result from using an on-line CRT. 

A complete definition of the language is given, using an extended 
Backus Normal form; included are semantic explanations and examples, a 
sample program, and some examples of code produced by the MOL compiler. 
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FOREWORD 



Development of the MOL (Machine-Oriented Language) began in October 

I 
1966 under joint sponsorship of the Advanced Research Projects Agency, 

Defense Department, and NASA, Langley Research Center. Although completion 

took approximately one year, only six man months have been invested in the 

project. The Augmented Human Intellect (AHI) Program (ENGELBARTl) is using 

the MOL as the base language for its software effort. The language and 

compiler have been explicitly designed to facilitate concurrent modification 

and development of AHI programming techniques. 

This report has bean prepared with the On- Line Text Manipulation 
System, and consequently it dif feres in a few respects from other teclmical 
reports. All paragraphs are hierarchically numbered; certain paragraphs 
bear "names," and references appear as an author's name, perhaps with a 
sequence number, enclosed in parentheses. 



IV 
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1 Original computer language development was guided by the already 
existing formalisms of the numerical analysts. The machine- independent 
evolutionary direction of the problem-oriented languages has enhanced 
their algoritlimic and algebraic nature, but destroyed their usefulness 
as system program languages, 

la TJie concerns of system programmers such as efficiency, tight 
code, and bit manipulations require a different orientation. Machine 
independence and algebraic constructs are not discarded but are 
enhanced; additional features are included to permit succinct, 
explicit references to hardware functions necessary in systems 
programming on a display-oriented time-sharing computer, 

2 Erwin Book (See B00K1) of System Development Corporation supplied the 
original impetus for our new language with his Q-32 machine-oriented 
language (MOL). Niklaus Wirth simultaneously undertook a similar 
project while at Stanford University. His PL-3bO (See WIRTlil) was 
designed as a precedence grammar (See WIRTH2) and used to implement a 
version of ALGOL on the IBM 3bO, 

3 Our aim throughout the development of MOL 940 was to design a 
coordinate language- compiler pair that permits the expression of clear, 
concise algorithms and the production of efficient, tight code. With 
such a language, fewer bugs slip in during coding, programmers can say 
what they want in fewer words, and (with a little luck) one can pick up 
some of his year- old code and understand it, 

3a Algorithmic clarity is mainly due to the structure implicit in 
the syntax of the language. 

3a1 It is significant in this regard that labels have almost 
disappeared in existing MOL code. Instead the CASE and WHILE 
statements are the primary means of controlling program flow (See 
WIRTH3), The program is not interleaved with many GOTO statements 
transferring into and out of sections of code so that only the 
original programmer can remember all the ways a certain statement 
may be reached. Just the way MOL code appears on a page makes the 
algorithmic flow clear (See SCHORREl). 

3a2 The succinctness of infix notation rather than assembly 
language also adds clarity. It is often quite difficult to pick 
up a random page of machine code and recognize that a set of five 
lines doing very strange things are actually testing for a flag in 
a word, but it is very easy to recognize a line of MOL and "see" a 
test being made, 

3b Our concern for the production of tight code led us to believe 
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that programmer and compiler must work together; the compiler alone 
cannot do the job. 

3bl While the programmer can do the job alone, it is usually too 
time consuming. The idiosyncrasies of the SDS 940 are reflected 
in the special constructs incorporated in the MOL which allow the 
programmer control of the code that is generated and the way in 
which registers are used. 

3c We have also included rather general expressions and assignment 
statements in the MOL. At times the programmer has no need for tight 
code and should be able to use the MOL on a higher level, leaving all 
the worrying about final constructs to the compiler, 

3d A unique consideration within the MOL design criteria is the 
accommodation of potential coordination between the structure of the 
language and a display- oriented time- sharing text editor. 

3dl With such a system, there may not exist hard copy and the 
programmer would be able to see no more than some twenty lines of 
his program at any one time. 

3d2 Much can be done to ease the programmer's movements within 
the code to facilitate manipulation of logical chunks of code and 
to allow at least everything that can be done with cards and a 
listing. 

3d3 We would like to give the programmer even more for we feel 
that our text structure conventions and the associated features of 
NLTS can be used for algorithm analysis; these techniques, coupled 
with the design of the language and of the compiler, provide 
greater power and facility for dealing with program design than 
more conventional methods such as flow charts. In ESDI we 
presented some basic discussion in this direction (See 
ENGELBART5) . 

4 The MOL 940 compiler uses a META compiler parser and a general 
operator-operand stack for the code producing algorithm. Additions to 
the syntax take only minutes to implement. As a result we do not try to 
plan for all future constructs. Instead, our attitude of restraint 
means that syntax is added when the need arises and the style of the 
construct is well thought out. 



MOL 940 — SECTION II: DEFINITIONS 



1 Terminology 

la The syntax for the MOL language is written in the META II 
notation. This provides an easy means of expressing the syntax in a 
form that is readable by both man and machine, yet allows great ease 
and flexibility in modifying the constructs that describe the 
language. 

lal The notation used for the META II syntax, as well as for the 
MOL language, is quite similar to the notation used in the ALGOL 
bO report. 

1a2 Terminal symbols are represented as strings of characters 
bounded by quotes. Nonterminal symbols take the form of an ALGOL 
identifier (i.e., a letter followed by a sequence of letters or 
digits). 

Ia2a Any terminal symbol consisting of a single character may 
be preceded by a single quote rather than enclosed in quotes to 
indicate that it is terminal. 

1a3 Concatenation is designated by writing items consecutively. 
The itens are separated by slashes to indicate alternation. Each 
syntax equation ends with a semicolon. 

1a4 A special syntactic entity represented as ".empty" has been 
incorporated to indicate that a syntactic element is optional, and 
is usually used in conjunction with alternation. 

laS Also, it is possible to "factor" part of a syntax equation; 
that is, parentheses can be used to group a sequence of items so 
as to treat it as a single item. 

lab A special operator, m$n (where m and n are optional 
integers) , is also used to designate "any number between m and n 
of occurrences of the following item." The default values of m 
and n are zero and infinity. This makes it possible to reduce the 
number of equations needed to obtain recursion on some item in the 
syntax. For example, the standard definition of identifier now 
becomes : 

laba identifier « letter $ (digit / letter);. 

lb The design philosophy for the MOL compiler was to follow tlie META 
II design, i.e., that of recursive recognizers. The reasons for this 
choice center around the following considerations: 

Ibl Most of the people using and designing the MOL language and 
compiler have had direct experience writing recursive recognizer 
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compilers. 

1b2 To design a precedence grammar and compiler means that the 
relationship between each character and all other characters has 
to be considered at each point, and an arbitrary construct cannot 
be added at will without possibly affecting the rest of the 
existing relations. 

2 Basic Symbols and Syntactic Entities 

2a General Vocabulary 

2al Terminal Vocabulary 

2ala ABCDEFGHIJKLMNOPQRSTUVXYZl 234 
5678 9 0()4.«$*: = -*;t, ./<>[] 

2a1b AND BEGIN BUNiP BY CALL CASE DECLARE DO DO-SINGLE ELSE END 
ENDP. ENTRY EXECUTE EXTERNAL FINISH FOR FROZEN GO GOTO IF INC 
NOT NULL OF OR POP PREFIX PROCEDURE PROC RETURN SET STEP THEN 
TO UNTIL VIRTUAL WHILE .A .E .V .LT .LE .EQ .NE .GE .GT .CB 
.NCB .AR .BR .XR .BRS .LSH .LCY .LRSH .RCY .RSH 

2a2 Nonterminal Vocabulary 

2a2a <abxreg> <actual> <actl> <act2> <act3> <act4> <address> 
<arpas> <assign> <band> <bexp> <block> <bor> <bound> <builtin> 
<bump> <call> <case> <constant> <cvar> <decl> <declaration> 
<entry> <equ> <equl > <exp> <ext> <exu> <factor> <for> <formal> 
<forral> <form2> <form3> <form4> <frozen> <frzl> <goto> <icon> 
<if> <immediate> <index> <indirect> <intersection> <item> 
<iterative> <labeled> <negation> <null> <parid> <prefix> 
<primary> <procedure> <product> <relation> <retum> <simple> 
<statement> <sum> <union> <value> <variable> <varfun> <virtual> 
<while> 

2b Primitives 

2bl Identifiers: An identifier is a symbol used to name a 
quantity (such as a procedure, a variable, or an array) , as a 
label or formal parameter. 

2bla Syntax: id » letter $5(letter / digit);. 

2b1b Semantics: An identifier (or more simply an id) is a 
string of letters and digits, with a maximum length of 6, the 
first of which must be a letter. 

2blbl All identifiers that are local to a procedure must be 
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declared at the beginning of the procedure. Those variables 
not declared or used as labels are assumed to be virtual, 
i.e., defined in some other procedure. No distinction is 
made among array, procedure, and label uses of identifiers. 

2blc Examples of Identifiers: 

2blcl I 

2blc2 CHAR 

2blc3 X21 

2blc4 I2J1JY1 

2b2 Numbers 

2b2a Syntax: number = 1$8 digit ("b" / .empty) ;. 

2b2b Semantics: A number is a string of digits, with a 
maximum length of eight characters, possibly terminated with a 
letter b. If the terminating character is a b, then the number 
is taken to be octal; otherwise it is taken to the base 10. 

2b2c Examples of Numbers: 

2b2c1 1 

2b2c2 1'024 

2b2c3 77770000b 

2b3 Strings 

2b3a 8-bit character strings are the only strings recognized 
by the MOL compiler, and these can only occur in declarations. 

3 Declarations: All declarations occur at the start of a procedure, as 
declarations are not allowed within a block. All variables declared in 
a procedure become local to that file (not just the procedure) , and 
external to that file, if so declared. Variables can be preset, arrays 
declared, and virtual symbols specified. 

3a Procedure: The procedure is the basic syntactic entity, in that 
one writes procedures, which are compiled, assembled, and loaded. 

3al Syntax: procedure « parid ("pop" "(" .num "," .num "," .num 
")" /.empty) (••procedure" /"proc'') formal ••;" $declar labeld $(••;'• 
labeld) ••endp."; 
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3ala parid « "(" id ")" ; 

3alb formal « "(" ( id / .empty) $2("," ( id / .empty)) '•)" ;. 

3a2 Semantics: The procedure declaration begins with an 
indentifier which serves as the name of the procedure. 
Optionally, one can declare a procedure to be a "POP" procedure so 
that it will be treated by the system as a user POP, 

3a2a Following the word "procedure" one optionally indicates 
the parameters to this procedure, A maximum of 3 is allowed, 
to correspond to the A, B, and X registers, which are the only 
arguments passed when a call to a procedure is made. These 
parameters are indicated by placing them after the word 
"procedure," and enclosing them in parentheses. 

3a2b After the procedure declaration comes a declaration of all 
the variables that are to be used in that procedure, their 
dimensions (if any) and their values if they are being preset, 

3a2c The sequence of statements that constitutes the 
executable code of the procedure follows these declarations. 
In this, note that one cannot declare variables within blocks, 
and that variables can only be declared at the beginning of a 
procedure, 

3a2d Finally, all procedures must end with an "endp", 

3a3 Example of Procedure: 

3a3a (get) procedure (x, i) ; declare x,i; return([x[i+l J]t4) 
endp, 

3b Declaration 

3bl Syntax: declaration = ( decl / ext / equ / virtue / frozen / 
prefix ) ";" ;. 

3c Decl 

3cl Syntax: decl = "declare " ("external " /.empty) item $("," 
item) ; 

3c1a item = .id (bound /.empty) (value /.empty) ; 

3clb bound » "[" (.id /.num) "J" ; 

3clc value * "="( "(" icon $("," icon) ")" / icon ); 
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3cld icon = (.num /.id /.st8) ;, 

3c2 Semantics: The basic declaration statement permits 
declaration of those variables which are to be allocated in the 
current procedure (and possibly made external to the current file, 
to indicate their dimensions (if arrays) , and to specify the 
values to which they are to be preset (numbers, addresses of 
identifiers, or strings). 

3c3 Examples: 

3c3a declare x,y,z[10]; 

3c3b declare external m=10,n-m,st=*end of file*; 

3c3c declare sk[10]=(0, 1,20, 40);. 

3d External 

3dl Syntax: ext « "external " evar $("," evar) ; 

3dla evar = .id ;. 

3d2 Semantics: The external declaration generates "ext" records 
for the assembler — that is to say, those variables following the 
"external" are defined to be external to the current file, but 
they are not allocated any storage. In this last respect they 
differ from variables which are declared via the "declare 
external" statement. "External" is sometimes used to declare 
labels to be external. 

3d3 Example 

3d3a external m,n,z;. 

3e Equate 

3e1 Syntax: equ = "set " equl $("," equl) ; 

3ela equl « .id "=" (.id /.num) ;. 

3e2 Semantics: Tlie equate declaration generates "equ" records 
for the assembler— that is to say, those variables that are 
indicated are equated to the value given at assembly time. This 
is useful in generating conditional assemblies, and in setting the 
array bounds via a "set " identifier. 

3e3 Example: 
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3e3a set m940=l ,skinax«10U;. 

3f Virtual 

3fl Syntax: virtue « "virtual " cvar $("," cvar) ; 

3fla cvar « .id (bound /.empty);. 

3f2 Semantics: If a variable is not declared in a file, then it 
is known as virtual. Via the "virtual" declaration, it is 
possible to tell the compiler which variables are expected to be 
virtual; appropriate checks can then be made, and when the 
cross-reference listing is generated, these variables will be 
marked "v" for virtual, instead of "u" for undefined, 

3f3 Example: 

3f3a virtual a,b,ra[32b];. 
3g Frozen 

3g1 Syntax: frozen « "frozen " frzl $("," frzl); 

3gla frzl a .id;. 

3g2 Semantics: The frozen declaration is used to tell the 
compiler that the following variables are local to this file, but 
that no storage should be allocated for the variables. Tliis 
distinction is needed because the codes for local and virtual 
variables are different. Since the loader links undefined symbols 
together through the address field, it is not possible to have a 
complex address field (such as " Ida m+1") for a virtual symbol. 
Thus for the compiler to generate the appropriate index register 
loads and the correct address field, it needs to know whether a 
variable is local or virtual. The frozen declaration is a way of 
making the compiler think that a variable is local when it is 
virtual. This is used in connection with the ARPAS "continue 
assembling", and "frozen symbol table" features, 

3g3 Example: 

3g3a frozen a,b,x;, 

3h Prefix 

3hl Syntax: prefix « "prefix " "for " ("generated " "labels:" 
.st8 /"temporaries:" .st8) ;. 

3h2 Semantics: By using a higher-level language, it is possible 
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to have the compiler generate labels and temporaries which, at the 
machine- language level, would otherwise have to be done by the 
user. However, the compiler is now generating labels and 
temporaries, using identifiers that are the same for each 
compilation. For debugging, and for generating reentrant code, it 
is useful to be able to specify different names. The "prefix" 
declaration permits the user to specify the names used for the 
generated labels and temporaries, 

3h3 Examples: 

3h3a prefix for generated labels: 'fmt'; 

3h3b prefix for temporaries: 'libet' ;• 

4 Expressions: An expression is an entity which represents a numerical 
value (contained in one 24-bit word) , This value is obtained by using 
the values of the identifiers and functions within the expression, and 
combining these values by means of the operators within the expression. 
Note that the symbols .ar, .br, and .xr are associated with the internal 
registers of the machine, and their values are the contents of the 
respective registers. 

4a Exp 

4a1 Syntax: exp « "if" bexp "then" bexp "else" exp / bexp ;. 

4a2 Semantics: A general expression can be either a conditional 
expression, using the "if then else" type of construct, or it may 
be an expression resulting from the combination of arithmetic. 
Boolean, or relational operators, 

4a3 Examples: 

4a3a if x .le y then 1 else 2 

4a3b X'»-y*z/(x+l) 

4b Bexp 

4bl Syntax: bexp « union;, 

4c Union 

4c1 Syntax: union *= intersection $ ("or" union) j. 

4c2 Semantics: The union makes it possible to combine expressions 
with the logical operator "or." The result of the "or" operator 
is true (i.e. not equal to zero) iff at least one of the 
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expressions is true. 

4c3 Example: 

4c3a X or y 

4d Intersection 

4d1 Syntax: intersection = negation $( "and" intersection);. 

4d2 Semantics: The intersection makes it possible to combine 
expressions with the logical operator "and," If both expressions 
are true, then the result will be true. 

4d3 Example: 

4d3a X and y 

4e Negation 

4el Syntax: negation « "not" negation / relation;. 

4e2 Semantics: This construct makes it possible to take the 
(logical) negation of the value of any expression. 

4e3 Example: 

4e3a not x 

4f Relation 

4fl Syntax: relation = sum (".gt" sum /",ge" sum /".ne" sum 
/".eq" sum /".le" sum /".It" sum /".cb" sum /".neb" sum /.empty ); 

4f2 Semantics: Tlie relational operators make it possible to 
construct logical statements which are true if the given arguments 
stand in the specified relation to one another. The operators are 
"greater than," "greater than or equal," "not equal," "less than 
or equal," "less than," "common bits," or "no common bits." The 
"common bits" operator yields a value of true iff both of its 
arguments have ones in any corresponding bit positions. The "no 
common bits" operator yields a value of true iff its arguments do 
not have ones in any corresponding bit positions. 

4f3 Examples: 

4f3a m ,gt n 

4f3b z .ne y 



10 



MOL 940 — SECTION II: DEFINITIONS 

4f3c X .cb y 

4g Sum 

4g1 Syntax: sum * product $(•'♦" product / "-" product);. 

4g2 Semantics: The sum permits one to combine expressions with 
the arithmetic operators •♦■ and - , Note that all values are taken 
to be 2 4 -bit integers, 

4g3 Examples: 

4g3a X 

4g3b X + y 

4g3c X - y •»• z 

4h Product 

4h1 Syntax: product = factor $(••*»• factor / "/" factor / "t" 
factor) ; • 

4h1a Syntax: factor « bor / "-" factor ; 

4h2 Semantics: Tlie product permits one to combine expressions 
with the arithmetic operators * (times), / (division), and t 
(mod). The result of these operators is a 24-bit integer, and in 
the case of the division, the remainder is discarded. Mod 
operates similarly to division except that the quotient is 
discarded and the remainder is the result of the operation, 

4h3 Examples: 

4h3a X 

4h3b X * y 

4h3c X / y 

4h3d X t y 

4i Bor 

4il Syntax: bor = band $(",v" band / ",x" band);, 

4i2 Semantics: The "bor" (standing for "bit or") makes it 
possible to obtain the bitwise "or" of two expressions. Both 
inclusive and exclusive "or" are allowed and are designated by ,v 
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and ,x respectively. 
4i3 Examples: 

4i3a X 

4i3b X ,v y 

4i3c X ,x y 

4j Band 

4jl Syntax: band = primary $('',a" primary);, 

4j2 Semantics: 'fhe "band" (standing for "bit and") makes it 
possible to obtain the "bit and" of two expressions, 

4j3 Examples: 

4j3a X 

4j3b X .a y 

4k Primary 

4kl Syntax: primary « bltin / abxreg / varfun / const /"(" exp 
")" / immed / indir ; 

4k1a bltin = ((".Irsh" "(" actual ")" .num /",lsh" "(" actual 

")" .num / ".rsh" "(" actual ")" .num / ".rcy" "(" actual ")" 

.num / ".rcy" "(" actual ")" .num ) (",2" /.empty)) /".brs" 
.num "(" actual ")" ; 

4k1b abxreg = ".ar" /".br" /".xr" ; 

4klc varfun « .id ("[" index "]" /"(" actual ")" /.empty); 

4kld const » .num; 

4kle immed « "$" (var / const ("[" index "]" /.empty)); 

4k1f indir « "[" (immed /var /const) "J" ; 

4klg var = .id ("[" index "]" /.empty); 

4klh index = "(" exp ")" / .num /(.id /".xr")("+" .num/"«" 
.num/.empty) ; 

4kli actual = ( .id / .empty ) $2( "," ( .id / .empty)) 
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4k2 Semantics: The primary consists of the basic entities that 
can be used to construct an expression. It provides for direct 
reference to the A, B, and X registers, use of the shift and cycle 
instructions with optional tagging, use of the BRS instruction, 
indexed variables, functions of up to three arguments, and both 
indirect and immediate addressing. Note that by means of the 
parenthesis, recursion is introduced, and thus complex 
expressions may be constructed from simpler ones, 

4k3 Examples: 

4k3a X 

4k3b x[i+1] 

4k3c 23 

4k3d pac(x,y) 

4k3e (x + y) 

4k3f [x] 

4k3g $x 

4k3h ,lsh(m, 0,6)3 + ,rsh(a,b,x)5,2 

5 Statements: A statement is the basic executable unit of aii MOL 
program. It denotes some action that is to be performed, which action 
may be the evaluation of expressions or the execution of other 
statements, 

5a Syntax: labeld = (parid •':" /, empty) stat ; 

Sal Stat = if / simple ; 

5a2 simple = block / goto / return / call / rcall / bump / arpas 
/ iterat / entry / case / null / exu / assign ;, 

5b If 

5bl Syntax: if = "if " bexp ("then " simple ("else " stat 
/•empty) /"do-single " stat);, 

5b2 Semantics: The "if" construct is the standard if statement 
with the optional "else" part. The added construct "do-single" 
indicates that the true part will consist of just one instruction 
and thus the code at the end of the test for the "bexp" can be 
compiled to minimize the branch and skip instructions. 
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5b3 Examples: 

5b3a if x then goto 12 else x«<-l ; 

5b3b if X ,ne z do-single bump i;, 

5c Block 

5cl Syntax: block « "begin " labeld $(••;" labeld) "end";, 

5c2 Semantics: The "block" construct allows the user to delimit 
a sequence of consecutive statements by "begin" and "end" to 
indicate that it is to be treated as a single statement. Note 
that declarations are not permitted within a block, 

5c3 Examples: 

5c3a begin x**-! ; y^x*y+z; (here): return (y) end; 

5c3b begin call inchar(char) ; char-^-char ,a 77b end;, 

Sd Goto 

5d1 Syntax: goto « ("goto " /"go " "to ") addr ; 

Sdla addr « var / indir / immed / const ;. 

5d2 Semantics: The "goto" generates an unconditional branch. 
This branch can be indirect, indexed, direct, or immediate, 

5d3 Examples: 

5d3a go to here; 

5d3b goto [$tra[i+lj]; 

5d3c goto $15b;, 

5e Return 

5e1 Syntax: "return" (*( actual *) /.empty ) ;, 

5e2 Semantics: It is possible, via the "actual" construct, to 
indicate what the contents of the A, B and X registers should be 
when returning from a procedure. This is optional, and if nothing 
is specified the registers remain as affected by the procedure, 

5e3 Examples: 
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5e3a return ; 

5e3b return (resul t) ; 

5e3c retum(in[i-2]-y,,ra+l);, 

5f Call 

5fl Syntax: "call " var ( •( actual •) / .empty ) ;. 

5f2 Semantics: The optional arguments following the "call " 
indicate the contents of the A, B and X registers of the 940, 
Thus it is possible to pass up to 3 arguments at call time to a 
procedure. Also, it is possible to subscript the name of the 
procedure being called, thus indicating an alternate to the 
declared entry point, 

5f3 Examples: 

Sf3a call sub; 

5f3b call output(char .a 77b,,filen); 

5f3c call table[i3(arg1,10*arg2);. 

Sh Bump 

5hl Syntax: bump « "bump " addr $("," addr );• 

5h2 Semantics: There is an instruction on the SDS 940 which adds 
1 to memory, and leaves the contents of the central registers 
unchanged. The "bump " construct indicates that this operation is 
to be performed on the sequence of items that follow the "bump," 

5h3 Examples: 

5h3a bump i; 

5h3b bump m[i-3],$1 ,[$stackp] ;, 
5i Arpas 

5il Syntax: arpas = "<" <copy across everything up to the next> 

llStf . 

5i2 Semantics: This construct allows the user to insert machine 
code into an MOL program, if some special sequence of code that is 
needed cannot be generated or even expressed by the language. 
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5i3 Examples: 

5i3a < sta temp>; 

5i3b < cio fnumo; tco cr; tco If; brs 10>;, 

5j Iterat 

5jl Syntax: iterat « for / while;. 

5k For 

5kl Syntax: for = "for " .id "from " exp ("inc " /"dec ") exp 
"to " exp "do " Stat ;, 

5k2 Semantics: The "for" statement provides a means of repeating 
a statement (or a block of statements) a specified number of 
times. By requiring the user to specify "inc" and "dec" it is 
possible to generate the appropriate code without complicated 
runtime or compile time computations. The limits on the for loop 
are not recomputed each time through the loop, but are computed 
once at the start. Note, however, that if an identifier is used 
as a limit, then the value of this identifier is used as the check 
each time, so that changing the value of this identifier will 
affect the "for" loop. 

5k3 Examples: 

5k3a for i from 1 inc 1 until n do iLij-^-O; 

5k3b for j from x+1 inc 1 to x*x do begin n[j]'Hn[j+1 j ; m[j]"«-0 
end;. 

51 IVhile 

511 Syntax: while « "while " exp "do " stat ;. 

512 Semantics: The "while" statement provides a means of 
repeating a statement (which can be a block) as long as an 
expression is true. This expression is reevaluated after each 
repetition of the "while" statement. 

513 Examples: 

513a while char .ne cr do char^incharO ; 
513b i-t-l ; while i ,le n do begin m[i]"«-0; bump i end;, 
5m Entry 
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Sml Syntax: entry * "entry '• .id formal ;. 

5in2 Semantics: The "entry" statement provides a means of 
indicating secondary entry points in a procedure. Any calling 
arguments that are indicated are stored, and a branch around the 
code generated by the "entry" statement is provided by the 
compiler, so that an "entry" statement can be inserted at any 
point without causing an interruption in the existing code. 

5m2a The return address is moved from the entry point to the 
name of the procedure, so that all returns can return to the 
procedure name. However, this is not done in the case of a 
reentrant procedure, as the return address is placed elsewhere. 

5m3 Examples: 

5m3a entry subset; 

5m3b entry inset (argl , inch!) ;. 

5n Case 

5nl Syntax: case = "case " exp "of " "begin " stat $(*; stat ) 
"end";. 

5n2 Semantics: The "case" statement provides a means of 
executing one statement out of many, depending on the value of the 
expression controlling the case statement. The same thing has 
usually been done by a series of nested "if" statements. If the 
value of the expression specifies a statement that does not lie 
within the range of the case statement, (i.e., from 1 to n=number 
of statements in the "case") then the last statement of the case 
is executed, 

5n3 Examples: 

5n3a case n of begin 

call subl (n); 

return; 

call error end;. 

So Null 

501 Syntax: null « "null" ;. 

502 Semantics: The "null" statement is included in the language 
so that there may be statanents within the case statement which do 
nothing. 
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5p Execute 

5p1 Syntax: exu - "execute " addr ;• 

5p2 Semantics: This construct reflects the SDS 940 instruction 
which can execute another instruction. It provides a means of 
locating and executing this instruction with any appropriate 
address (i.e., with indirect addressing, index modification, 
etc.). 

5p3 Examples: 

5p3a execute m[i]; 

5p3b execute [$0]; 

5p3c execute 00220002b;. 

5q Assign 

5q1 Syntax: assign = (var /abxreg /indir / immed) $("," (var / 
abxreg / indir / immed)) *<- ("♦" /.empty ) exp ;. 

Sq2 Semantics: The "assign" statement provides a means of 
assigning values to variables, registers, and actual memory 
locations. Provision is made for multiple stores, in which case 
the stores are done in sequence from right to left. Also, if the 
item next to the •«- is a register, the value will be placed in that 
register, and the remaining assignments done from that register; 
otherwise the assignments are taken from the register that the 
value happens to be left in by the expression analysis. Note too 
that the construct '*^ is used to indicate that an "add to memory" 
is to be done rather than a store. This is a special meaning, and 
thus precludes the use of a unary plus, 

5q3 Examples: 

5q3a x-*-! ; 

5q3b m[i],l-H(x*b-c/d)*t; 

5q3c .ar,m,,br*-i'»-l ; 

5q3d m[i]'«H-,ar;, 



18 



MOL 940 — SECTION III: SYNTAX 

1 The following is the syntax for the MOL, Note that backup is 
required to compile, but the backup is only past an identifier after the 
next character has been recognized. This gets over a lot of problems 
concerning assignment statements and labels. 

2 prog s (.id /.empty) $(arpas *; / proc ) "finish" ; 

3 proc = parid ("pop" .sp *( ,num "," .num "," .num *) / .empty .rp 
.rr) ("procedure" /"proc") formal •; (.tp $decl2 /$declar) labeld $(•; 
labeld) "endp." ; 

3a parid = • ( .id ♦) ; 

3b foimal = •( (.id ("," forml / form4) /"," forml /foTm4 ) •) / 
form4 ; 

3bl forml = .id ("," foTm2 / foiroS) /"," form2 /fonn3 ; 

3b2 form2 « .id /fotm3; 

3b3 forms - .empty ; 

3b4 fonn4 « .empty ; 

4 declar - (decl / decl2 ) * ; ; 

4a decl2 = ext / equ / virtue / frozen / prefix ; 

4b decl = "declare" ("external" .rl /.empty .si) item $("," item); 

4bl item = .id (bound /.empty ) (value /.empty) ; 

4b2 bound = V'[" (.id /.num) "]" ; 

4b3 value = "=" ( •( icon $("," icon) •) / icon ) ; 

4b4 icon = (.num /.id /.st8 ) ; 
4c ext = "external " evar $("," evar) ; 

4cl evar = .id ; 
4d equ s "set " equl $("," equi) ; 

4dl equl = .id "=" (.id /.num ) ; 
4e virtue = "virtual " evar $(",*' evar) ; 

4el evar = .id (bound /.empty ) ; 
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4f frozen « "frozen " frzl $("/• frzl) ; 

4fl frzl « .id ; 

4g prefix « "prefix " "for " ("generated " "labels:" .st8 
/"temporaries:" .stS ) ; 

5 labeld = (parid ":" /.empty) stat ; 

5 Stat = if / simple ; 

7 if « "if " bexp ("then " simple ("else " stat / .empty ) /"do-single 
" stat); 

8 simple « block / goto / return / call / rcall / bump / arpas / 
iterat / entry / case / null / exu / assign ; 

9 block = "begin " labeld $('; labeld) "end"; 

10 goto = ("goto " /"go " "to ") addr ; 

11 return « "return" (•( actual •) /.empty ) ; 

12 call = "call " var ( •( actual •) / .empty ) ; 

13 bump = "bump " addr $("," addr ); 

14 arpas = "<" <copy across everything up to the next> ">" ; 

15 iterat = for / while; 

16 for = "for " .id "from " exp ("inc " .si /"dec " .ri) exp "to " exp 
"do " stat ; 

17 while - "while " exp "do " stat ; 

18 entry « "entry " .id formal ; 

19 case = "case " exp "of " "begin " stat $('; stat ) "end" ; 

20 null = "null" ; 

21 exu = "execute " addr ; 

22 assign = (var /abxreg /indir / immed) $("," (var / abxreg / indir / 
immed)) •t ("+" .sa /.empty .ra) exp ; 

23 exp « "if " bexp "then " bexp "else " exp / bexp; 
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24 bexp s union; 

25 union = inter $("or " union ); 

26 inter = neg $ ("and •' inter ) ; 

27 neg = "not " relat / rolat; 

28 relat « sum (".It " sum .re .rb /".le " sum .re .sb /".eq " sum 
.re .rb /".ne " sum .re .sb /".ge " sum .re .sb /".gt " sum .re .rb 
/".cb " sum .re .sb /".neb " sum .re .rb /.empty); 

29 sum = prod $("+" prod /"-" prod ) ; 

30 prod s factor $("*" factor /"/" factor /'-•- factor ); 

31 factor « bor /"-" factor ; 

32 bor « band $( ".v " band / ".x " band ); 

33 band « prim $(".a " prim ); 

34 prim = bltin / abxreg / varfun .se / const .se / '( exp ') / iramed / 
indir; 

35 abxreg = ".ar" / ".br" /".xr" ; 

3b bltin =((".lrsh" »( actual •) .num /".Ish" •( actual •) .num /".rsh" 
•( actual •) .num )(",2" /.empty)) /".brs" .num •( actual '); 

37 varfun = .id ("[" index "]" /♦( actual •) /.empty ); 

38 var = .id ("[" index "J" / .empty); 

39 index » '(exp •) .te /.num /(.id /".xr" ) ("♦" .num /"-" .num 
/.empty ) ; 

40 addr « var / indir / immed / const ; 

41 immed = "$" (var / const ("[" index "]" / .empty)) ; 

42 indir = "[" (immed /var /const ) "j" ; 

43 const = .num .se ; 

44 actual = .empty (exp ("," acti / act4) /"," actl /.empty act4 ) ; 
44a actl « exp ("," act2 / act3) /"," act2 /act3 ; 
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44b act 2 « exp /act 3 ; 
44c acts « .empty ; 
44d act4 = .empty ; 
45 synerr « $("endp." /) ; .end 



22 



MOL 940 — SECTION IV: OPERATION 

1 The MOL Executive 

1 a User Interface 

1a1 The MOL Executive is the interface between the user and the 
MOL compiler. It uses the command-recognition structure of the 
SUS 940 time sharing system itself, especially that of the QEI) 
subsystem. 

lala A special meaning is attached to certain control 
characters; when one of them is typed by the user, the 
remainder of the control word or phrase is echoed by the EXEC. 
Some characters represent commands to be performed, others 
represent flags requiring a yes/no type of answer, and others 
require file names, such as Input: /prog/, 

lalb Each command requires a period for confirmation. If any 
other character is typed, then a space and a question mark are 
echoed and the command is aborted. 

Ia2 The various characters recognized and their meanings are as 
follows: 

la2a (i) Input: "I" is typed to specify the input file for 
the MOL compiler. After the I has been typed, a file name 
should be given, followed by a period. 

1a2a1 An input file must be specified with each new 
compilation. This file will be closed when the compilation 
is finished. 

1a2b (o) Output: "OS is typed to specify the output file for 
the MOL compiler. After the "O" has been typed, a file name is 
expected and should be acknowledged by a period. 

Ia2b1 Each time the compilation process is initiated, the 
old output file is closed and the new one opened. If, 
however, the new output file name is the same as the last 
one used for output, or if none has been specified, then the 
last file is not closed and the next set of output is 
appended to the current output file, 

Ia2b2 It is possible to specify different files for output, 
should the wrong one be given. However, when execution of 
the compiler begins, the last file specified for output will 
be used. 

Ia2c (b) Begin Compilation: "B" is typed to indicate that all 
file names and flags have been specified for the current 
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compilation, so that compilation may now actually be initiated, 

la2cl If there is insufficient information (such as lack of 
file names) to initiate the compilation process, the command 
will be aborted, 

1a2c2 When a successful compilation has been performed, the 
message '•***end of compilation***" is typed. If control 
returns to the user without this message, then the 
compilation has not been completed because of an error 
condition (such as running, out of room on the RAD, or an 
illegal instruction trap from the compiler, etc.), 

Ia2d (z) Zap: "Z" is typed to terminate the MOL Executive and 
return control to the TSS Executive. IVhen "zap." is typed, any 
remaining files that are open are closed. 

Ia2e (1) Listing (interlinear): "L" is typed to set the flag 
controlling the interlinear listing. The expected response is 
either a "y" or "n" for "yes" and "no", respectively, although 
a period alone will be taken as a "yes" response. 

Ia2el When the interlinear listing is sent to any file 
other than the controlling Teletype, all semicolons are 
converted into $ so that ARPAS will not terminate a comment 
in the middle of the line. 

1a2f (t) Type Procedure Names: "T" is typed to set the flag 
which determines whether or not procedure names are typed on 
the controlling Teletype as they are compiled. If the flag is 
set, then as each procedure is encountered by the compiler, the 
name of the procedure is typed. The response to this command 
is in the usual "y" (yes) or "n" (no) manner. 

1a2g (c) Cross Reference: "C" is typed to request a 
cross-reference listing of the identifiers used in the input 
file. The response to this command is a file name that is to be 
used for the cross-reference listing, such as "Teletype". 

1a2gl This listing gives the names of the identifiers in 
alphabetical order, along with their status (undefined, not 
used, etc.) and an ordered list of the line numbers on which 
they are used. 



Ia2h (r) Reentrant: "R" is typed to set the flag that governs 
whether or not the compilation produces reentrant code. A "y" 
or "n" response for "yes" or "no" is expected and must be 
acknowledged with a period. 
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Ia2h1 If the response if yes, then the flag for "generate 
temporaries" (see below) is automatically set to "no", 

1a2i (g) Generate Temporaries: "G" is used to set the flag 
which specifies whether or not the temporaries used in the last 
input file are to be allocated at the end of the output file. 

Ia2il If this flag is on, the the temporaries are allocated 
(this is the usual case). If the flag is off (set by giving 
a "no" response), then the temporaries are not allocated. 
The latter is generally used when reentrant code is being 
produced, and then in connection with the "prefix for 
temporaries " declaration. 

Ia2j (k) Keep compiling: "K" is the same as "begin 
compiling," except that some parts of the MOL compiler are not 
reinitialized: 

la2jl These are the symbol table and the temporary- and 
generated- label counts. The purpose of this command is to 
provide a means of compiling one input file, and then 
another, as if they were all the same input file. 

Ia2k (q) Quick: "Q" causes the supression of the string which 
is normally echoed for each command character. 

1a21 (v) Verbose: "V" causes the printing of the string which 
gives the meaning for each character typed as a command. 

Ia2m Any other characters typed are illegal; the MOL Executive 
will respond with a space followed by a question mark. 

lb Error Recovery and Error Messages 

Ibl The only errors which should noimally be expected are syntax 
errors in the user's input file. 

Ibl a When such an error occurs, an appropriate error message 
is typed, along with the line number and line which caused the 
error. Also an uparrow is typed under the last character 
interpreted by the compiler. 

Ibib To attempt an error recovery, a scan is made for the next 
"endp.", stacks are reset, and an attempt is made to restart 
the compiler to look for a procedure, Tliis type of procedure 
has proven fairly useful, and is far better than just giving 
up. 

1b2 Another user error which may arise is the occurrence of 
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identifiers or numbers longer than the maximum length allowed (6 
and 9 respectively). In this case a warning message is tyj^ed, the 
remainder of the string is skipped, and compilation continues. 

Ib3 Next on the list of errors are stack and symbol-table 
over/underflow, 

lb3a All the stacks and symbol tables have been set to 
adequate sizes for most programs, and the normal user will 
never encounter the bounds. When and if they are exceeded, an 
error message to this effect is typed and the compilation 
process is terminated* 

lb4 Yet another, even more obscure, error is one caused by an 
illegal string passed to FMT (a routine internal to the MOL 
compiler). 

1b4a Such a string originates in the syntax equations 
themselves, and this error can only be the result of changes 
made in the syntax file of the compiler; when this is 
cross-checked by FMT, the error is detected. This is treated 
as a fatal error, and compilation ceases. But this error 
should never occur in the normal course of events, 

IbS Finally there are two types of errors from which there is no 
recovery at present, 

IbSa Internal conditions in the compiler, such as illegal 
memory references or illegal instructions, or program loops 
(hopefully none of these will ever occur), 

IbSb Conditions external to the compiler, such as running out 
of room on the RAD, or a rubout by the user, or a system 
crash. 
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1 (inchar) The "inchar" procedure is an inteimediate interface between 
the input medium and the compiler. 

la This routine buffers one line of text at a time, outputs it to 
the output file (if the list option is set) and returns the next 
character in the A register* 

lb "inchar" also has an entry point to print error comments to the 
controlling Teletype should any syntax error be detected, 

(inchar) procedure; 
declare nchar»80, mchar«80, maxch=80, line[80], i ; 
declare external list^l , nline=0, lf*153b, cr=155b, space=Ob; 
declare star=* **,arrow=' t*, peeked«0; 
if peeked then 
begin 

peeked-^O ; 

return (line [nchar]) end; 
if nchar .ge mchar 
then 
begin 

for i from inc 1 to maxch do 
begin 

line[i] -*- gench(); 
if .ar .eq If then goto ml end; 
mchar <- maxch; 
goto m2; 

(ml ) : mchar -^ i ; 
(m2) : if list then 
begin 

call putch(star); 

for i from inc 1 to mchar do call putch(line[ij) end; 
nchar *<■ o; 
bump nline end 
else bump nchar; 
retum(line[nchar]) ; 
entry (perr) ; 

call putch(star); 

for i from inc 1 to mschar do putch(line[i]) ; 
for i from inc 1 to nchar-1 do putch(space) ; 
call putch(arrow) ; 
call putch(cr); 
call putch(lf); 
return 
en dp. 
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